首页> 外文OA文献 >Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation
【2h】

Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation

机译:投影中变量重要性(VIP)与变量选择和解释的选择性比(SR)方法的比较

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This study compares the application of two variable selection methods in partial least squares regression (PLSR), the variable importance in projection (VIP) method and the selectivity ratio (SR) method. For this purpose, three different data sets were analysed: (a) physiochemical water quality parameters related to sensorial data, (b) gas chromatography-mass spectrometry (GC-MS) chemical (organic compound) profiles from fossil sea sediment samples related to sea surface temperature (SST) changes, and (c) exposed genes of Daphnia magna female samples related to their total offspring production. Correlation coefficients (r), levels of significance (p-value) and interpretation of the underlying experimental phenomena allowed the discussion about the best approach for variable selection in each case. The comparison of the two variable selection methods in the first water quality data set showed that the SR method is more accurate for sensorial prediction. For the climate data set, when raw total ion current (TIC) GC-MS chromatograms were considered, variables selected using the VIP method were easier to interpret compared with those selected by the SR method. However, when only some chromatographic peak areas (concentrations) were considered, the SR method was more efficient for prediction, and the VIP method selected the most relevant variables for the interpretation of SST changes. Finally, for the transcriptomic data set, the SR method was found again to be more reliable for prediction purposes.
机译:本研究比较了两种变量选择方法在偏最小二乘回归(PLSR),投影中变量重要性(VIP)方法和选择性比(SR)方法中的应用。为此,分析了三个不同的数据集:(a)与感官数据有关的理化水质参数,(b)来自与海有关的化石海沉积物样品的气相色谱-质谱(GC-MS)化学(有机化合物)剖面表面温度(SST)的变化,以及(c)大型蚤(Daphnia magna)雌性样品的暴露基因与其后代总产量有关。相关系数(r),显着性水平(p值)和对潜在实验现象的解释使我们可以讨论每种情况下最佳变量选择方法。在第一个水质数据集中两种变量选择方法的比较表明,SR方法对于感官预测更为准确。对于气候数据集,当考虑原始总离子流(TIC)GC-MS色谱图时,与使用SR方法选择的变量相比,使用VIP方法选择的变量更易于解释。但是,当仅考虑某些色谱峰面积(浓度)时,SR方法的预测效率更高,而VIP方法选择了最相关的变量来解释SST变化。最后,对于转录组数据集,再次发现SR方法对于预测目的更为可靠。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号